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Abstract. A simplification of Apery's proof of the irrationality of £(3) is presented. 
The construction of approximations is motivated from the viewpoint of 2-dimensional 
recurrence relations which simplifies many of the details of the proof. Conclusive evi- 
dence is also presented that these constructions arise from a continued fraction due to 
Ramanujan. 
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O ■ 1. Introduction 

(D ; 

Q . In 1978, R. Apery p] presented his famous proof of the irrationality of C(3). His method 

| involved the explicit construction of two solutions a n and b n of the recurrence 

(1) (n + l) 3 w n +i = (34n 3 + 51n 2 + 27n + 5)u n - n 3 w n _i, 

| for n > 1 such that a n /b n — > ((3) as n — > oo. These solutions also satisfied the arithmetic 
properties b n , [1, 2, . . . n] 3 a n G Z. Put together, these facts turned out to be sufficient 
to complete the proof of irrationality of C(3)- For an account of the history and the 
"miraculous" nature of this construction, see van der Poorten [7|. 

Many proofs of the irrationality of £(3) have followed since then, all of which construct 
the same sequences, a n and b n , by vastly different methods (see Fischler [2] for a survey). 
One of these proofs is by Apery himself [2] (arguably his only complete proof of this 
result). This paper deals with a method of interpolation for continued fractions and 
constructs a series of continued fractions for £(3) from which the sequences a n and b n are 
obtained. We also remark that none of these proofs have generalisations to higher zeta 
values. For example, it is still unknown whether £(5) is irrational. 

In Vol. 2 of Ramanujan's notebooks, Berndt [5] suggests that a certain continued 
fraction of Ramanujan is related to the proof in [2]. Recall that the Hurwitz zeta function, 
C(s, x) is defined for Re s > 1, Re x > by 

•I— I OO j 

c(s,x) = y2~ — ■ — -• 

H ■ ^— ' [n + x) s 

& . n=o K ' 

The continued fraction of Ramanujan in consideration ([S], Entry 32(iii), p. 153) is 

1 — l 6 — 2 6 — 3 6 

(2) C(3 ' X + X) = P(0,x)+ P(l,x)+ P(2,x)+ P(3,x) + 

for Re x > —~ where P(n,x) = n 3 + (n + l) 3 + (An + 2)x(x + 1). In the discussion 
following this entry in [5], it is stated that the specialisation x — 1 yields a continued 
fraction for ((3) which is "of crucial importance" in the work of Apery [2]. 

Around the same time, F. Apery [3] in a biographical note on his father R. Apery, states 
that the construction in [1] is motivated from a "number table due to Ramanujan". 
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In this note, we take the view that a more detailed analysis of the method in [2] leads 
one to the conclusion that the constructions are indeed based on Ramanujan's continued 
fraction ([2]). We present here such an analysis, which also allows us to give a simplified 
proof of Apery's result, which we state as 

Theorem. ((3) is irrational. 

This note is organised as follows. We first present the constructions of [2] from the 
viewpoint of 2-dimensional recurrence relations in Sec. 2. Section 3 is devoted to the proof 
of the theorem. In Sec. 4, we end with some concluding remarks on the comparison with 
Apery's approach, the relation to the Ramanujan's continued fraction and generalisations 
to other constants. 

2. Construction of the tables 
We start by defining the homogenous polynomials 

(3) /(/../) / :i • -n-j ■ ■>;/ ■ /. 

g(i,j)=i 3 -2i 2 j + 2ij 2 -f. 

We identify here the key properties of these polynomials that will be used in the con- 
struction. They are 

(4) f(i,j)g(t,j) = f(t,0)g(t,0) + f(0,j)g(0, 3 ), 

(5) f(i + l,j) - f(ij + 1) = g(i + 1, j + 1) - g(i,j). 

Now we present the 2-dimensional recurrence which will play a central role in our 
construction. 

Proposition 1. The recurrence 

(6) (ffci\ 9{0J)\ ( \ = f{i Q) ( V 

for integers i,j>l has a rational valued solution Uij for each of the following boundary 
conditions 

(a) Vi, j > 0, u 0d = u 0) i = 1, 

(b) Mo,o = and Vi, j > 1, 

= S Tikn) II ~mW = n 3 ' 

n<j k<n n<j 

ui,o = ^2 /(^o) n fko) = s n 3 - 

n<i k<n n<i 

Proof. We first derive additional conditions that any solution of ()6]) has to satisfy. The 
top entry on the right in ()6]) for i,j is the same as the bottom entry for i,j + 1. Hence 
the solution has to satisfy the recurrence 

(7) /(0, j + l)ui-i J+1 = (f(i,j) - g(i,j + l)K-i,j + #(0, fiin-tj-i. 

This is a condition on the solution for row i — 1 with i > 1. Next by inverting ([6]) and 
using property (j3J), we get 

(8) ( fyfi ( U ^ ) = ,(,,0) ( U ^ ) . 

This equation likewise leads to the condition that u^j satisfies 

(9) f(0,j + = (/(*, j + 1) - g(i,j))u itj + g(0,j)uij-i. 
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This is a condition on the solution for row i, which, by property flS]), is the same as (|7|) 
with i — 1 replaced by i. 

Conversely, the recurrence fl6]) can be used to construct row i from row i — 1 in a well- 
defined manner, if ([7]) is satisfied for row % — 1. We will use this observation, inductively 
along the rows, to construct the required solutions. 

First it can be easily verified that both of the given boundary conditions (a) and (b) 
satisfy ([7]) for the row i = 0. This will be the base case. 

We now assume that we have constructed the solution upto row i — 1 and that the row 
i — 1 satisfies (jTJ). This implies that the recurrence (jH]) can be used to construct row % 
from row i — 1 in a well-defined manner. Hence ([8]) holds and by the discussion above, 
we can then conclude that row i also satisfies ([7]) and the induction step is complete. 

Hence there exist solutions to fl6]) satisfying the given boundary conditions (a) and (b) 
along the row i = 0. The only step remaining is to verify that these solutions satisfy the 
respective boundary conditions along the column j = 0. This can be verfied by using fl^]) 
and (JED to get 

ff{i,j) -g(i,0)\ ( Uij-i "\ _ f(n -x ( Ui,j \ 
\f(i,0) -g(i,j)J U-y-J ~ J{ ,J) WW ' 
This gives us the condition 

f(i + 1, 0)u i+ltj -i = (f(i,j) +g(i + 1, - g(i, 0)«i_ li5 -_i. 

Now it can be easily verified that the given boundary conditions are solutions of this for 
j = 1 and the initial values u^o and m ,o agree with our previous construction. □ 

Call the solutions to ([6]) corresponding to the initial conditions (a) and (b) of Prop. Q] 
as q it j and respectively. Now we explore the arithmetic properties of these tables of 
rational numbers. Here we use an additional property of f(i,j), namely 

(10) /(0,x),/(x,0)G{x 3 ,-x 3 }. 

We shall use the notation d n = [1,2,... n] for the rest of this note. 

Proposition 2. Any solution ho/Q has the property that Uij is a Z-linear combination 
ofui-ij, ik,j-i and Ui-ij..! fori,j > 1. Hence q id and d 3 max{i j) pij are integers. 

Proof. For the proof of the first statement of the proposition, we start with the assumption 
that the gcd = 1. By (fit)]) , this means that the gcd (/(0, j), f(i, 0)) = 1. Hence, 
there exists an integer x such that 

f(i,j) = xf(0,j) mod/(z,0). 

Multiplying this by g(i,j), we get 

f(hj)g(hj) =xf(0,j)g(i,j) mod f(i,0). 

Using 01]) makes the left side = f(0,j)g(0,j) and cancelling f(0,j) from both sides 
gives 

g(0,j) = xg(i,j) mod/(i,0). 

Hence, by subtracting x times the second row from the first row in we get coef- 
ficients which are divisible by f(i, 0). Thus we conclude that Z-linear 
combination of Ui-ij and Since a; is an integer, we get the first part of the 

proposition, for the special case = 1. 
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The general case — d is handled by calling %' = i/d, f = j/d, dividing <^ by d 3 
and using the homogenity of f(i,j) and g(i,j) to get 

/(*',/) 9(0, f)\ ( \ _ m ( Uij 

f(0,f) 9(i',f)J Wu-i) ~ n ' J W-i 

This reduces to the previous case as (i',f) = 1 and we proceed as before and complete 
the proof of the first statement of the proposition. 

For the second statement of the proposition, we use the first statement recursively to 
obtain that 11% j IS cL Z-linear combination of m ,o, 1*0,1, ■ ■ ■ u j, «i )0 , «2,o, • • • «i,o- Thus the 
second statement on qij and pij follows from the arithmetic properties of the boundary 
values (a) and (b) in Prop. HJ D 

Now, we will prove that Pij/qij converge to £(3) uniformly in % and j. For simplicity 
of presentation we shall use one more proerty of /, g namely 

(11) f(i,j) - f(o,j) > g(i,j) - g{o,j), i,j > 1. 

We define the table e it j as 

(12) = ?ijC(3) -Pij. 
and prove 

Proposition 3. The table e^j — > uniformly as i,j — > 00. 

Proof. First we note that condition (jlip implies that for 2 > 1, 

f(i,j)x + g{0,j) > f(0,j)x + g{i,j). 

Using this in (JBJ with 2 = qi-ij/qi-ij-i, gives the implication gj-ij > Q'i-ij-i =>■ %j > 
Hence, we conclude that qij is monotonically increasing along rows i > 1. 
Define the following quantities: 

= ~ Pi-l,j-lQi-l,ji = Pi,j-lQi-l,j-l — Pi-l,j-lQi,j-l- 

Now, we take the second row of (jSJ) for both pjj and gij, multiply by (7i_i j_i and 
resp. and subtract to get 

(13) fsv™ = i 3 6% 

Similarly using the second rows of © gives 

(14) = 

Using ffT3]) and (IT41) recursively with the initial values = j~ 3 (verified directly) gives 
us Slf = j~ 3 and 5f°- = i~ 3 for z, j > 1. 

Now, we deduce the difference in the ratios r^j = Pij/qij along the columns, 

' i,j ' t— lj -3 ) 

Qi,jQi—l,j 1 Qi,jQi—l-,j 

tends to for i fixed and j — »■ 00. This, coupled with the fact that r 0j - — >■ C(3) (from the 
boundary conditions in Prop. 1), implies that each row in r^j has ((3) as limit. 
As for the difference in r^- along the rows, 



Hence for any i,j we have 

lC(3)-rd<4Ep- 

%i k>j K 

Multiplying by q^j on both sides, we get that je^l < C(3)/<??.,j which tends to zero uni- 
formly in i,j by montonically increasing integers q^j (see Prop. 2). □ 

3. Proof of the theorem 

For the proof of the theorem we shall use the criterion that a number a is irrational if 
there exists a sequence of integers a n and b n such that 

(15) ^ \a n — b n a\ — > as n — > oo 

For, if not, let a = r/s with coprime integers r,s. Then the modulus of the Z-linear 
form in 1 and a in (fl5|) is either or > 1/s for any choice of integers a n and b n . This 
contradicts (TT5]) and hence a is irrational. 

The linear forms in 1 and £(3) needed to use the above criterion will come from the 
diagonal e n) „. For estimating the decay of these forms, we shall need Poincare's theorem in 
vector form. For a discussion of the history of this theorem and references, see Aptekarev 
et. al. [1], Ch.3, Sec.l. Poincare's theorem is usually used in the simpler setting of 
1-dimensional recurrence relations and the following (|4J, pp.1104) is a generalisation. 

Proposition 4. (Poincare-Perron, in vector form) Let f 1 = (x™, x%, ■ ■ ■ be a sequence 
of k- dimensional vectors which is a solution of 

(16) x n = A n x n ~\ 

where the vectors are taken to be column vectors and A n is a k x k matrix. Let A n — >■ A 
as n oo, where A is a diagonalizable matrix with eigenvalues of distinct magnitude. 
Then, either xT 1 = eventually or there exists a component j of f 1 such that 

x n x n 

(17) lim \ = A, and lim — — e, 

n->oo Xj n— >oo Xj 

where (A, e) is an eigenpair of A. 

If the system (fT5"|) is nondegenerate (A n is nonsingular for alln), then for any eigenpair 
{\ e) of A, there exists a solution xf 1 of ffTB]) and component j such that (|17[) holds. 



We use PropJUby defining x" = (e n ,n+i! e n,n) an d 



6n 3 +9n 2 +5ra+l 



'18) An = i ("+ 1 ) 3 ("+ 1 ) 3 , , 

i o J V 1 ° 



6 -1 



We note that the difference equation ffl6|) is satisfied. This is because, by the definition 
f[T2"|) of e it j it is also a solution of the recurrence (jH]) of Prop. 1. The second matrix 
on the right in ffTS]) is from (E]) with i — n,j — n which transforms (e n _i jn , e n _i jn _i) 
to (e n ,n ; £n,n-i)- The first matrix is from ([7]) with i — n + l,j — n which transforms 
(en,n,e n , n _i) to (e nin+ i,e njn ) as required. 

We have A n — >■ A where 

'35 -6" 
6 -1 



.4 



with the eigenvalues of A being 17 ± 12a/2 and the corresponding eigenvectors being 
(18 + 12^2,6). 
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Now, we apply Prop. H]and note that the system is nondegenerate for n > 1. Hence, 
the eventually zero case cannot occur because the vector (ei 2 , ex,i) is not the zero vector. 
Thus, there exists j and an eigenpair (A, e) such that (IT7|) holds. 

Note that in (fTTl) . we can always choose j to be any of the choices where e 3 ■ ^ 0. Since 
neither of the eigenvectors of A has any zero entries, we can conclude that ffTTl) holds for 
both values j = 1,2. We choose j = 2 and get 

(19) |en,n|=e"( logA+ °«), 

where A = 17±12\/2. However, A = 17+12^ is not possible because that will contradict 
Prop. [3J Hence, A = 17 — 12\/2 in the decay estimate (IT91 . 

Recall that the prime number theorem implies that d n = e™^ 1+0( - 1 ^ ([7], p. 198). From 
Prop. [2J we obtain that d? n p n ^ n and q n>n are integers. Hence, the decay estimate for d\e n>n 
is 

(20) K?n,nC(3) - tfc.nl = e" (3+1 ° g A+ ° (1)) . 

omce log(17 - 12\/2) < —3, we conclude that the Z-linear form in 1 and £(3) in fTSOj) 
is nonzero and tends to as n — > oo. By the criterion (fl5|) . we conclude that £(3) is 
irrational. 



4. Concluding remarks 

Comparison with Apery's approach: 

At this point, we would like to stress that the construction of the tables pij and q^j is 
entirely from [2] and it is Apery's method that we have adapted for the proof of Prop. 
1. The only novelty in our approach is in viewing these tables as the solutions of a 2- 
dimensional recurrence, whereas in [2] they appear as a sequence of continued fractions 
(indexed by row). 

The advantage for us is that the proof of the arithmetic properties in Prop. 2 is 
natural and elementary, whereas in [2], a proof for qij e Z is given using differential 
equations and there is no indication of the proof for the properties of pij. This is a 
reasonable achievement for our approach, given that these properties are considered deep 
("a fundamental miracle" [7], p. 202). Proposition 3 is also an easy consequence of the 
recurrence (jH]), while there is not even a mention of this fact in [2J. 

The last step in j2j is the claim that the "diagonal" sequences p^i and q^i satisfy the 
recurrence ([1]) and hence the (usual) Poincare's theorem gives the irrationality of C(3)- 
We remark that his claim can easily be verified from our recurrence relations. We have 
preferred to follow the vector based approach because it is more natural and the vector 
form of the Poicaree's theorem is not more difficult than the 1-dimensional version (see 

Relation to Ramanujan's continued fraction: 

Now we sketch a proof that Apery's approach of viewing each row as a continued 
fraction recovers the intimate connection with Ramanujan's continued fraction (J2J) - First 
we note that along each row where % is fixed, p^j and satisfy (J7|) (with i — 1 replaced 
by i). Hence the recurrence satisfied by jl 3 Pij and j\ 3 qij is 

(21) u iJ+1 = P(j, i)uij - j 6 Uij-i, 

for j > 1. Here we used the fact that f(i + l,j) — g(i + l,j + 1) = P(j,i) where 
P(j, i) = j 3 + (j + l) 3 + (4j + 2)i(i + 1) is the same polynomial appearing in (T5]). 
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Hence if we define the continued fraction u(i) by 

-l 6 -2 6 -3 6 
~ P(l,i)+ P(2,i)+ P(3,i)+ 
then from the theory of continued fractions, we can obtain 

(22) C (3) = ^ ( ; Ko+Ki . 

wW?i,o + %i 

We recall that q it o = 1 and p it0 = J2 n <i n ~ 3 - ^ * s a ^ so eas y ^° obtain, using the methods 
of Sec. 2, that q^i = P(0,i) and = P(0,i)p i0 + 1. Putting all these values in (1221) . 
we get exactly the identity of Ramanujan (j2j) specialized at x = i. 

Now, in principle, Carlson's theorem can be used to derive (j2J) for the entire halfplane 
Rex > —1/2 from the equality at all positive integers, if we show that the growth of 
the continued fraction is sufficiently slow (the Hurwitz zeta function is bounded in this 
range) . 

Thus, we see that the point of view in |2j gives a clear link with ([2]) and bolsters the 
claim that Apery's constructions were indeed motivated from ([2]). We must mention at 
this point that the idea of using fl2]) to produce such a startling diophantine application 
is a testimony of the genius of Apery. 

Generalisations: 

As a final remark, these methods can be used as in |2j to show that log 2 and £(2) 
are irrational by suitable choices for f(i,j) and g(i,j). For log 2, the choice f(i,j) = 
i + 3 j g(hj) = i — j satisfies all the conditions specified in Sec. 2 and yields a proof of 
irrationality as in Sec. 3. This is indeed related to continued fraction Entry 29 (Cor.) of 
[5J. Similarly for C(2), the choice f(i,j) = i 2 + ij + \j 2 , g(i,j) = —i 2 + ij — \j 2 yields 
a proof of irrationality and is related to Entry 30 (Cor.) of [5]. Note that ([TO"]) is not 
satisfied in this case, and more work is needed to establish the analogue of Prop. 2. 

However, there exist no nontrivial homogenous polynomials f(i,j), g(i,j) of degree 
> 3 which satisfy the conditions (jl]) and ([5]). Since these conditions are crucial in the 
construction, we conclude that our approach, in its current form, fails for higher zeta 
values. One may need to look beyond 2x2 matrix recurrence relations to attack these 
constants! 
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